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positions l to 10: can be deleted 
position 36: G is tolerated 
position 10: R or V are tolerated 
position m-. 5 or N are tolerated 
position A3: A" is tolerated 
position 46: V or N are tolerated 
position 91: 5 is tolerated v 
positions 123 and 156: L are tolerated 
position 223: A and S are tolerated 

Changes that affect enzvme activity (demonstrated) 

POSltlOn 19: L tO S 

position 38: I to S or N 
position 39: E to 5 or R 
position 10: I to 0 
position 12: r to B 
position 44: U to E G or H 
position 45: J to E or D 
position 46: Y to C 
position 47: T to 1? or N 
position 80: U to I 

position 144: D to E pir; K 

position 145: U to E r I O. O 

position 146: G to E 
position 147: S to 5 



Group I Intron Encoded Endonucleases and Related Endonucleases 



ENOONUCLEASE 



RECOGNITION SEQUENCE 



CLEAVAGE 



[ S[TE — ■ V INTRON SITE 



to 



a. 
m 



a: 
o 



I-Sce I 
(Saccharomyces mitochondria) 

I-Sce IV 
(Saccharomyces mitochondria) 

I-Sce H 
(Saccharomyces mitochondria) 

I-Ceu I 
(Chlamydomonas chloroplast) 

I-Ppo J 
(Physarum nucleus) 

I-Sce m 
(Saccharomyces mitochondria) 



cgcitaggga£LaIcagggtaat|atagc 

GCG IAT CCCjT AT T GT CCCAT T A| T AT CG 



T T C T C A T GAtSIg C T C T A AT CC AT G G 
AAG AGT A C|T AA T CG A GATTAGGTACC 



C T T T G G T CAIJLCjA GAAGTATAT ATTT 
G lAA ACCAGjTAGGT CTTCATI ATA TAAA 

taacggt^cia_aJggtagcgaaattca 
att gccagigat tccatcgcttt aagt 



TGACTCT CjLLAj&JGGT AGCCAAATGCC 
AC |TGAGAG|AAT TCCAT CGGf TTT ACGG 



GGAGGTTTTGGTAACTATTTAT TACC 
CCT CCAAAACCATT GATAAATA ATGG 



I-Cre I 
(Chlamydomonas chloroplast) 



GGGTTCAAAACGTCGTGAGACAGTTT 
CCCAAGTTTTGCAGCACTCTGTCAAA 



o 

8 
1 



Endo. See I(RF3) 
(Saccharomyces mitochondria) 
(Non intronic) 

HO 

(Saccharomyces nucleus) 
(Non intronic) 



GAT GCT G TAGJLCJAT AGGCTTGGTTAT 
CTA CGACAITCCGTAT CCGAACCAATA 



CfTT T C C G C AAJLAjG T ATAATTTT ATAA 
GAA AG G C GIT T G T C A T A TT AA A A TATT 



I-Csm I 
(Chlamydomonas mitochondria) 
(Putative endonuclease) 

I-Pan I 
(Podospora mitochondria) 
(Putative endonuclease) 



ACCAT GGGGTCAAAT GTCTTTCTGGG 
TGGTACCCCAGTTT A CAGAAA GACCC 

V 

GTGCCTGAATGAT AT TTATTACCTTT 
CACGGACTTACTAT A AATAATGGAAA 



UJ 

ZD 



§ 



(Bacteriophage T4) 



I Tev I 



CA 
GT 



GCT CAGTAGATGTTTTCTT GGGT C TACCGT T T A AT 
GCGAGT CAT CTACAAAAGAA CC C A G AT GGCA A AT T A 



I Tev n 



I Tev HE 



CAAGCTTATGAGTATGAAGT G A ACACGJJT A T T 
GTTCGAATACTCATACTTCAC TTGTGlCAAT A A 

GCT ATT CGTTT tLtAT GT A T CT TT T G C G T GT AGCT T T A A 
CGA T A A GCAAA A ATjA CATAGAAAACGCACATCGAAATT 
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EXPRESSION VECTORS 

EcoRI EcoRI 
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Soc I Csp6 I Alw I 
HgiA I BstK I Taq I 
Ed 136 I BsaJ I Sal I 
Sph I Rmo I Kpn I BstY I Hinc II 
NspC I Bsp1286 I Ben I Sfe I 
EcoR I Xba I Ban I BamH I Acc I I-Sce I 
Apo I Nlo IH Ban II Ben I Sfe I Rma I 
Taq I Nsp7524 I Alu I Ava I Pst I Hga I 
Alu I Nsp I Bfa I Asp718 Alw I BspW I Bfa I F co R V 

I II II II II II II II I I I I I |J I 

CCMGCTCGMTTCGCATGCTCTAGAGCTCGGTACCCGGGATCCTGCAGTCGACGC |IAGCGAIAACAGGG T AA TACAGAT 23' 
GGnCGAGCTTMGCGTACGAGATCTCGAGCCATGGGCCCT 

2244 2255 2262 2271 2279 ' 2286 ' ' 2296* * 2318 

2247 2255 2266 2275 2284 2292 

2249 2256 2265 2275 2284 2296 

2249 2261 2271 2279 2289 2297 

2255 2265 2276 2284 

2255 2262 2271 2279 2289 
2265 2275 2289 

2265 2275 2290 

2265 2272 2280 
2271 2279 
2272 2280 
2275 
2275 
2275 
2275 
2275 
2275 
2276 
2276 
2276 
2276 
2276 
2276 

2280 
2280 
2280 
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Sau3o I Sse8337 I BamH I Kpn I Apo I 

Mbo I Nlo m Taq I Alw I Bon I Alu I BceF I 

Dpn II Sph I Sal I Rma I Ben I Eel 136 I Hae HI 

Dpn I NspC I Hinc II Mnl I Avo I Bsp1286 I Gdi II 

BstY I Nsp7524 I Ace I Bfa I BsoJ I Ban U Eoe I 

Bgl II Nsp I BspM I Xba I Alw I Asp718 Taq I Bsr I 

II III M III III M ill I I I I I || | I | | | | 

ATCAGATCTMGCTTGCATGCCTGCAGGTCGACTCTAGAGGATCCCCGGGTACCGAGCTCGAATTCACTGGCCGTCGTTT 2400 

Ui TAGTCTAGATTCGAACGTACGGACGTCCAGCTGAGATCTCCTAGGGGCCCATGGCTCGAGCTTAAGTGACCGGCAGCAAA 
sjl I I I I I I I • I I I I I • I I I I I I I I I I I I I I • | | | | | 

r. 2324 2336 2344 2354 2361 2369 2379 2387 

!l 2324 2336 2348 2355 2364 2375 2389 

W« 2325 2336 2348 2358 2365 2375 2389 

«0 2325 2336 2348 2355 2365 2375 2390 

2325 2337 2349 2360 2369 2376 2391 

m 2325 2341 2360 2369 2381 

2328 2342 2351 2360 2369 2381 

HF 2330 2342 2351 2360 2370 

U 2331 2342 2361 2370 

<¥* 2361 2375 

2361 2375 
« 2361 

2365 
2365 
2365 
2365 
2365 
2365 
2365 

2366 

2366 

2366 

2366 

2366 

236C 

Mae II 2366 
ScrF H 
EcoR n 
Dsa 2 

. BstN I Fok I Alu I 

Mae HI BstK I Fnu4H I Pvu II 

Mae II Bsr I BsaJ I Mse I Bbv I NspB II 

III II I I II II 

TACAACGTCGTGACTGGGAAMCCCTGGCGTTACCCAACTTMTCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGT 2480 
ATGnGCAGCACTGACCCTTTTGGGACCGCAATGGGTTGAATTAGCGGAACGTCGTGTAGGGGGAAAGCGGTCGACCGCA 
III 'II I I *l I • «| | 

2405 2413 2423 2440 2451 2471 

2410 2424 2451 2471 

2424 2457 2472 

i«i FIG. IOB 
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Construction: pGP 704 from De Lorenzo, with transposose 
gene and insertion of the Iinker[l-Scel] in NotI unique site 
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. 1L .Construction: pD 123, from J.D. Boeke 
with insertion of a Hnker[I-SceI-NotI] in BomHI 
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